Construction of speech corpus in moving car environment
نویسندگان
چکیده
The Center for Integrated Acoustic Information Research (CIAIR) at Nagoya University has been collecting speech corpora in moving cars which are made available as resources to advance the research and development of robust ASRs and spoken dialogue systems under high-noise conditions. The speech corpus consists of (1) phonetically balanced sentences, (2) digit strings, (3) discrete words and (4) transcribed spoken dialogues between drivers and information systems for navigation and information retrieval. These data are collected in vehicles under both idling and driving situations. The language of the corpus is currently Japanese. The number of subjects is currently about 300, total recording time is over 200 hours and total corpus size is about 160GByte. We have also been recording video images from three di erent angles, vehicle-control signals, and vehicle location, all synchronized with the speech recording. We report the objective of the speech corpus, the recording methods and the recording vehicle developed.
منابع مشابه
Analysis of drivers' speech in a car environment
In order to accelerate the promotion of speech recognition systems to the public; understanding characteristics of speech in real environments is one of the most important issues. This paper reports variations of speech characteristics in a car environment. To analyze speech characteristics in the specific environment, a corpus, recorded carefully in terms of equality of utterances and conditio...
متن کاملConstruction and Evaluation of a Large In-Car Speech Corpus
In this paper, we discuss the construction of a large in-car spoken dialogue corpus and the result of its analysis. We have developed a system specially built into a Data Collection Vehicle (DCV) which supports the synchronous recording of multichannel audio data from 16 microphones that can be placed in flexible positions, multichannel video data from 3 cameras, and vehicle related data. Multi...
متن کاملConstruction of Back-Channel Utterance Corpus for Responsive Spoken Dialogue System Development
In spoken dialogues, if a spoken dialogue system does not respond at all during user’s utterances, the user might feel uneasy because the user does not know whether or not the system has recognized the utterances. In particular, back-channel utterances, which the system outputs as voices such as“yeah”and“uh huh”in English have important roles for a driver in in-car speech dialogues because the ...
متن کاملMulti-Dimensional Data Acquisition for Integrated Acoustic Information Research
The Center for Integrated Acoustic Information Research (CIAIR) at Nagoya University has been collecting various kinds of speech corpora for both of acoustic modeling and speech modeling. The corpora include multi-media data collection in moving-car environment, collection of children's voice while video gaming, room acoustics at multiple points, head related transfer functions of multiple subj...
متن کاملSome Results on the Development of a Hands-free Speech Recognizer for Car-environment
This paper describes some activities being conducted at IRST with the aim of developing a technology for hands-free speech recognition in car environment. This technology is based on Hidden Markov Models and is being developed and evaluated by using the car database collected in the European projects SpeechDatCar and VODIS-II. Preliminary experiments are based on the use of filtered clean speec...
متن کامل